Syntactically Coded Corpus of Spoken Lithuanian: Developmental Issues and Pilot Studies

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Syntactically Annotated Corpus of Japanese Spoken Monologue

Tomohiro Ohno∗ , Shigeki Matsubara†‡, Hideki Kashioka‡, Naoto Kato‡, Yasuyoshi Inagaki§ ∗Graduate School of Information Science, Nagoya University †Information Technology Center, Nagoya University ‡ATR Spoken Language Translation Research Laboratories §Faculty of Information Science and Technology, Aichi Prefectural University Furo-cho, Chikusa-ku, Nagoya, 464-8601 Japan [email protected]

متن کامل

Lithuanian-Latvian-Lithuanian Parallel Corpus

The goal of the paper is to present different problems related to the building of Parallel Corpus for two small languages, namely, Latvian and Lithuanian. The Lithuanian-Latvian-Lithuania Parallel Corpus (LILA) will contain 8 million running words; will be bidirectional, aligned on the sentence level. The problems include identifying, acquiring, preparing, and aligning parallel texts.

متن کامل

The ATIS Spoken Language Systems Pilot Corpus

Speech research has made tremendous progress in the past using the following paradigm: de ne the research problem, collect a corpus to objectively measure progress, and solve the research problem. Natural language research, on the other hand, has typically progressed without the bene t of any corpus of data with which to test research hypotheses. We describe the Air Travel Information System (A...

متن کامل

postnatal studies of bats (pipistrellus kuhlii and miniopterus schreibersii) & histomorphology and histochemistry studies of organs and diseases of (neurergus microspilotus and n. kaiseri)

1. to determine whether difference in birth body mass influenced growth performance in pipistrellus kuhlii we studied a total of 12 captive-born neonates. bats were assigned to two body mass groups: light birth body mass (lbw: 0.89 ± 0.05, n=8) and heavy birth body mass (hbw: 1.35 ± 0.08, n=4). heavier body mass at birth was associated with rapid postnatal growth (body mass and forearm length) ...

Computational and Linguistic Issues in Designing a Syntactically Annotated Parallel Corpus of Indo-European Languages

This paper reports on the development of the PROIEL parallel corpus of New Testament texts, which contains the Greek original of the New Testament and its earliest IndoEuropean translations, into Latin, Gothic, Old Church Slavic and Classical Armenian. A web application has been constructed specifically for the purpose of annotating the texts at multiple levels: morphology, syntax, alignment at...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Studies About Languages

سال: 2016

ISSN: 2029-7203,1648-2824

DOI: 10.5755/j01.sal.0.28.15131